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Abstract 

The Lovasz Local Lemma [5] (LLL) is a powerful result in probability theory that 
states that the probability that none of a set of bad events happens is nonzero if the 
probability of each event is small compared to the number of events that depend on it. 
It is often used in combination with the probabilistic method for non-constructive exis- 
tence proofs. A prominent application is to fc-CNF formulas, where LLL implies that, 
if every clause in the formula shares variables with at most d < 2 k je other clauses then 
such a formula has a satisfying assignment. Recently, a randomized algorithm to effi- 
ciently construct a satisfying assignment was given by Moser [12J. Subsequently Moser 
and Tardos [13] gave a randomized algorithm to construct the structures guaranteed 
by the LLL in a very general algorithmic framework. We address the main problem left 
open by Moser and Tardos of derandomizing these algorithms efficiently Specifically, 
for a /c-CNF formula with m clauses and d < 2 fc /( 1+e )/e for some e € (0, 1), we give an 
algorithm that finds a satisfying assignment in time 0(m 2 ( 1+1//e )). This improves upon 
the deterministic algorithms of Moser and of Moser- Tardos with running time mp( k > 
which is super polynomial for k = to(l) and upon other previous algorithms which work 
only for d < 2 fc / 16 /e. Our algorithm works efficiently for a general version of LLL under 
the algorithmic framework of Moser and Tardos |13j . and is also parallelizable, i.e., has 
poly logarithmic running time using polynomially many processors. 
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1 Introduction 



The Lovasz Local Lemma [5] (LLL) is a powerful result in probability theory that states 
that the probability that none of a set of bad events happens is nonzero if the probability of 
each event is small compared to the number of events that depend on it. LLL is often used 
in combination with the probabilistic method to prove the existence of certain structures. 
For this, one designs a random process guaranteed to generate the desired structure if none 
of a set of bad events happen. If those events fall under the above assumption, the Lovasz 
Local Lemma guarantees that the probability that the random process builds the desired 
structure is greater than zero, thereby implying its existence. Often, the probability of this 
good event is exponentially small. Consequently, the same random process cannot directly 
and efficiently find the desired structure. The original proof of the Lovasz local lemma [5] is 
non-constructive in this sense. Starting with the work of [2] , a series of papers pD, HJ [101 HH1 E] 
have sought to make the LLL constructive. We note that in most applications where LLL is 
useful (e.g., El E]), the proof of existence of the desired structure is known only through 
LLL. 

Algorithms for the LLL are often formulated for one of the two model problems: fc-CNF 
and k- uniform hypergraph 2-coloring. Interesting in their own right, these problems seem 
to capture the essence of the LLL in a simple way. Further, algorithms for these problems 
usually also lead to algorithms for more general settings to which LLL applies. For fc-CNF, 
the LLL implies that /c-CNF formulas in which each clause intersects at most 2 k /e other 
clauses, always have a satisfying assignment. The objective is to find such a satisfying 
assignment efficiently 

Recently Moser [12J made the breakthrough by discovering an efficient randomized algo- 
rithm for finding a satisfying assignment for /c-CNF formulas in which each clause intersects 
at most 2 k /32 — 1 other clauses. This is best possible up to a constant factor. Subsequently, 
Moser and Tardos [13J, building upon the work of Moser [12], gave an efficient random- 
ized algorithm that works for the general version of LLL under a very general algorithmic 
framework (discussed below). They also give a randomized parallel algorithm under slightly 
stronger assumptions. They derandomize their sequential algorithm under the strong con- 
dition that the maximum degree in the dependency graph is a constant. We address the 
open question posed by them about derandomizing their algorithm when the degrees of the 
dependency graph are unbounded. 

For fc-CNF, the running time of the deterministic algorithms of Moser [12] and of Moser- 
Tardos [13] behaves like m n ^ k \ In this paper, we extend the work of Moser and of Moser- 
Tardos in several ways. We give a deterministic algorithm with running time 0(m 2( - 1+ ^), for 
/c-CNF formulas with m clauses such that no clause shares variables with more than 2 fc /( 1+£ )/e 
other clauses, where e is any positive constant. We obtain this algorithm as a corollary to 
our deterministic algorithm which works for a general version of LLL in the algorithmic 
framework of Moser and Tardos [13] . We also give deterministic parallel algorithms in this 
framework as long as the events satisfy appropriate complexity assumptions (which seem 
essential to have a handle on the events). 

1.1 Algorithmic Framework 

To get an algorithmic handle on the LLL, some restrictions on the probability space under 
consideration are imposed. In this paper we follow the general algorithmic framework for 
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the local lemma due to Moser-Tardos [13] : Every event A in a finite collection of events 
A = {Ax, ■ ■ ■ ,A m } is determined by a subset S C V of a finite collection of mutually 
independent discrete random variables V = {Pi, ■ ■ ■ , P n }; let Di be the domain of Pi. We 
denote by vbl(A) the minimal subset SCP that determines A. We define the dependency 
graph G = G A for A to be the graph on vertex set A with an edge between events A,B G A 
if A^B but vbl(A) n vbl(P) ^ 0. For A E A we write r(A) = T A (A) for the neighborhood 
of A in G and T + (A) = T(A) U {^4}. Note that events that do not share variables are 
independent. 

We think of A as a family of "bad" events, and the objective is to find a point in the 
probability space, or equivalently an evaluation of the random variables, where none of the 
bad events happen. We call such an evaluation a good evaluation. Moser and Tardos 
[T3] gave a constructive proof of the general version of the local lemma (Theorem [1]) in 
this framework which seems sufficiently general to capture most applications of LLL, using 
Algorithm 1. 

Algorithm 1: Sequential Moser— Tardos Algorithm 

1. For every P G V, v p <— a random evaluation of P. 

2. While 3A G A : A happens when (P = v P : VP G V), do 

(a) Pick one such A that happens. 

(b) Resample(A): For all P G vbl(A), do 

• vp <— a new random evaluation of P; 

3. Return (vp)p e p 

Theorem 1. If there exists an assignment of reals x : A — > (0, 1) sitc/i t/iat 

VA6i: Pr(A) <x'(A) :=x(A) JJ (l-x(B)), (1) 

SGr(A) 

i/ien t/ie probability that none of the events in A happens is at least YIa^a^ ~ > 0- 

The expected number of resamplings done by Algorithm 1 before it finds a good evaluation, 

i-x(A)r 

We work under the framework described in this subsection throughout the paper. 
1.2 Our results 

To state our results we define the following parameters for x(A) and x'(A) as in Theorem [1] 
whenever such an assignment exists. 



D := maxp g -p{domain(P)}. 

h 

~x'(A) l-x(A) 



M := max <j n,4m,4^ 2| ^ }l ■ \ where A = {A G A \ x'(A) > ^}. 



w min := min j4e ^{-loga; / (^)}. Note that w min >l/M. 
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These definitions are used throughout the rest of this paper. 

To obtain the sequential deterministic algorithm for the general version, we need certain 
complexity assumptions about the events. The complexity assumptions regarding the con- 
ditional probabilities in the following theorem are same as the ones used by Moser-Tardos 



Theorem 2. Let the time needed to compute the conditional probability Pr[A\\/i G / : P t = Vj\ 
for any A G A and any partial evaluation (vi G Di)i e i where I C [n] be at most tc- If there 
is an e G (0, 1) and an assignment of reals x : A — > (0, 1) such that: 



It is illuminating to look at the special case of fc-CNF both in the statement of our 
theorems and proofs as many of the technicalities disappear, while retaining the basic ideas. 
For this reason, we will state our results also for fc-CNF. 

Corollary 3. Let F be a k- CNF formula with m clauses such that each clause in F shares 
variables with at most 2~ fe other clauses for some e G (0, 1). Then there is a deterministic 
algorithm that finds a satisfying assignment for F in time 0(m 2 ^ 1+ ^). 

This improves upon the deterministic algorithms of Moser [T2] and of Moser-Tardos 
(specialized to fc-CNF) [13] with running time m n( - k \ which is super polynomial for k = oj{1). 
Our algorithm works in polynomial time under a far wider range of parameters than previous 
algorithms. To the best of our knowledge, the previous best deterministic algorithm was for 
hypergraph 2-coloring: This is the problem of coloring the vertices of a hypergraph with two 
colors so that no hyperedge is monochromatic. A polynomial time deterministic algorithm 
for /c-uniform hypergraphs in which no edge intersects more than 2 fc / 16-2 other edges appears 
in [9|. Our theorem above, applies equally well to the 2-coloring problem to state that a 
deterministic algorithm running in time 0(m 2 ^ 1+ ~- ) ) can give a 2-coloring of a fc-uniform 

hypergraph with m edges in which no edge intersects more than 2~/e other edges. Thus, 
while the previous algorithms for hypergraph 2-coloring run efficiently only in the case when 
e > 15, our algorithm is efficient for any e > 0. 

For the general setting, the deterministic parallel algorithm works under a fairly general 
complexity assumption regarding the events, which is the case for many applications; for 
example, our algorithm applies to fc-CNF and hypergraph 2-coloring. 

Theorem 4. Let the time needed to check if an event A G A happens using processors 
be at most t eva i- Suppose, there is an e G (0, 1) and an assignment of reals x : A — > (0, 1) 
such that: 




1+e 




Then, a deterministic algorithm can find a good evaluation in time 
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If there is a constant c such that every event A e A has decision tree complexity at most 
cmin{— logo;' (A), log M}. Then there is a parallel algorithm that finds a good evaluation in 
time 

/logM 

O I \pMIS + t eva lj 

using processors, where tj^is is the minimum time to compute the maximal inde- 

pendent set in a m-vertex graph using parallel processors on a EREW PRAM. 

Restating our theorem again for fc-CNF, we get the following corollary. 

Corollary 5. Let F be a k- CNF formula with m clauses such that each clause in F shares 
variables with at most 2^ /e other clauses for some e G (0, 1). Then there is a deterministic 
parallel algorithm that finds a satisfying assignment for F in time 0(Mog 3 m) using mP^ 1 /^ 
processors on a EREW PRAM. 



Organization. In the next section we give an intuitive description of the new ideas in 
the paper. We will define the partial witness structure formally in Section 3 and give the 
sequential deterministic algorithm and the proof that it works efficiently in Section 4. Finally, 
we present the parallel algorithm and its running time analysis in Section 5. We end with 
a brief discussion on Lopsided Local Lemma and further work. All missing proofs are given 
in the appendix. 



2 Techniques 

In this section, we informally describe the main ideas in the paper at a high level for the spe- 
cial case of fc-CNF and indicate how they generalize; reading this section is not essential but 
provides intuitive aid for understanding the proofs in this paper. For the sake of exposition 
in this section, we omit numerical constants in some mathematical expressions. Familiarity 
with the Moser-Tardos paper [13] is very useful but not necessary for this section. 

Let F be a /c-CNF formula with m clauses. We note immediately that if k > log m, then 
the probability that a random assignment does not satisfy a clause is l/2 k < l/(2m). Thus 
the probability that on a random assignment, F has an unsatisfied clause is < 1/2, and so 
it can be easily found in polynomial time using the method of conditional probability (see, 
e.g., So, from now on we assume that k < logm. We also assume that each clause in F 
intersects with at most 2 k /e other clauses; thus LLL guarantees the existence of a satisfying 
assignment. 

To explain our techniques we first need to outline the deterministic algorithm of Moser 
and of Moser-Tardos which work in polynomial time, though only for k = 0(1). Consider a 
table T of values of the random variables: for each variable in V the table has a list of values 
for that variable. Now, we can run Algorithm 1 described above using T: instead of randomly 
sampling afresh each time the value of a random variable is needed, we pick its next unused 
value from T. The fact that the randomized algorithm terminates quickly in expectation 
(Theorem [1]), implies that there are small tables on which the algorithm terminates with a 
satisfying assignment. The deterministic algorithm finds one such table. 

1 Informally, a function / having decision tree complexity at most k means that we can determine its 
evaluation f(x) by adaptively querying at at most k components of x. 
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The constraints such a table has to satisfy can be described in terms of witness trees: For 
a run of the randomized algorithm, whenever an event is resampled, a witness tree "records" 
the sequence of resamplings that lead to the current resampling. We say that a witness 
(we will often just say "witness" instead of "witness tree") is consistent with a table, if this 
witness arises when the table is used to run the randomized algorithm. If the algorithm runs 
for a long time, then it has a large consistent witness certifying this fact. Thus if we use a 
table which has no large consistent witness, the algorithm terminates quickly. 

The deterministic algorithms of Moser and of Moser-Tardos compute a list L of witness 
trees satisfying several properties: (1) If no tree in L is consistent with a table, then there 
is no large witness tree consistent with the table. (2) The expected number of trees in L 
consistent with a random table is less than 1. We need this property to apply the method 
of conditional probabilities. (3) For the method of conditional probabilities to be efficient, 
we need the list L to be polynomial in size. 

We now motivate how these properties arise naturally while using Algorithm 1. In the 
context of fc-CNF formulas with m clauses satisfying the degree bound, Moser (and also 
Moser-Tardos when their general algorithm is interpreted for fc-CNF) prove two lemmas 
which they use for derandomization. The expectation lemma states that the expected number 
of large consistent witness trees of size at least logm is less than 1/2 (here randomness is 
over the choice of the table). We could try to use the method of conditional probability to 
find a table such that there are no large witness trees consistent with it. However there are 
infinitely many of them. This difficulty is resolved by the range lemma which states that if 
for some u, no witness tree with size in the range [u, ku] is consistent with a table, then no 
witness tree of size at least u is consistent with the table. Now we can find the required table 
by the method of conditional probability to exclude all tables with a consistent witness of 
size in the range [logm, k logm]. The number of witnesses of size in this range is m n ^ k \ To 
run the method of conditional probability we need to maintain a list of all these witnesses, 
and find the values in the table so that none of the witnesses in the list remain consistent 
with it. So, the algorithm of Moser (and respectively Moser-Tardos) works in polynomial 
time only for constant k. 

One possible approach to resolve the issue of large sized list would be to find a way to 
implicitly maintain the list and carry out the method of conditional probability efficiently 
even though the list size is large. We have not succeeded in this. However we are able to 
reduce the size of this list using two new ingredients: 

Partial Witness Trees. For a run of the Moser-Tardos randomized algorithm, for each 
time instant of resampling of an event, we get one witness tree consistent with the input 
table. Given one consistent witness tree of size ku + 1 removing the root gives rise to up to 
k new consistent witnesses, whose union is the original witness minus the root. Clearly one 
of these new subtrees has size at least u. This proves their range lemma. The range lemma 
is optimal for the witness trees: for a given u it is not possible to reduce the multiplicative 
factor of k between the two endpoints of the range [u, ku]. 

We overcome this limitation by introducing partial witness trees, which have similar 
properties as the witness trees, but have the additional advantage of allowing a tighter range 
lemma. The only difference between witness trees and partial witness trees is that the root 
instead of being labeled by a clause C (as is the case for witness trees) , is labeled by a subset 
of variables from C. Now, instead of removing the root to construct new witness trees as in 
the proof of the Moser-Tardos range lemma, each subset of the set labeling the root gives a 
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new consistent partial witness tree. This flexibility allows us to prove the range lemma for 
the tighter range [u, 2u\. The number of partial witness trees is larger than the number of 
witness trees because there are 2 k m choices for the label of the root (as opposed to m choices 
in the case of witnesses) since the root may be labeled by any subset of variables in a clause. 
But 2 k < m since as explained at the beginning of this section, we may assume without loss 
of generality that k < logm. So the number of partial witnesses is not much larger and the 
expectation lemma holds with similar parameters for partial witnesses as well. The method 
of conditional probability now needs to handle partial witness trees with size in the range 
[logm, 2 logm]. The number of partial witnesses in this range is m n ( k \ which is still too 
large. The next ingredient brings this number down to a manageable size. 



e-slack. By introducing e-slack, that is to say, by making the stronger assumption that each 
clause intersects at most 2 <yl ~ e ^ k /e other clauses, we can prove a stronger expectation lemma: 
The expected number of partial witnesses of size more than (logm)/(ek) is less than 1/2. 
We use the fact that the number of labeled trees of size u and degree at most d is less than 
(ed) u < 2^ 1 ~ £ ^ ku . Thus number of partial witnesses of size u is less than 2 k m2^~^ ku , where 
the factor 2 k m < m 2 accounts for the number of possible labels for the root. Moreover, the 
probability that a given partial witness of size u is consistent with a random table is 2~ fe( "~ 1 ) 
as opposed to 2~ ku in the case of a witness tree; this is proved in a similar manner as for 

/ logm 

witness trees. Thus the expected number of partial witnesses of size at least 7 = 



consistent with a random table is 



ek 



< ^m 2 2 (1 - £)feM • 2~ fe( "- 1) < ^m 3 2~ efal < 1/2 

Now, by the new expectation and range lemmas we can reduce the size range to 
[(log m)/ek, 2 (log m)/ek}. The number of partial witnesses in this range is polynomial in 
m; thus the list of trees that the method of conditional probability needs to maintain has 
polynomial size. 



General Version. For the general version, the root of a partial witness, instead of being 
labeled by a subset of the variables of an event, is labeled by a subset of the neighbors of 
that event, and all children of the root must have labels from this subset. This leads to a 
large number of possible labels for the root. However, for the range lemma to hold true it 
suffices if we restrict the root of a partial witness to have as label, only a small number of 
certain chosen subsets of the neighbors. This choice of subset labels is given by a binary tree 
Ma for event A. 

Another ingredient of the algorithm in the general setting is the notion of weight of a 
witness tree. The weight of a tree is the sum of the weights of individual vertices; more 
weight is given to those vertices whose corresponding bad events have smaller probability of 
occurrence. 

Our deterministic algorithm for the general version also finds a list L satisfying the three 
properties mentioned earlier; however as opposed to size, the weight becomes the parameter 
of concern: Our list L consists of partial witnesses with weight (as opposed to size) in the 
range [7, 27], where 7 is a number depending on the problem. If we define L using a range 
for the sizes instead of weights, no list with the properties mentioned above may exist. 
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Parallel Algorithm. For the parallel algorithm, we use the technique of /c-wise (^-dependent 
probability spaces due to Naor-Naor[T^] and its extensions in the later work [6], [3] . This tool 
was also applied by Alon p] to give deterministic parallel algorithm for /c-uniform hyper- 
graph 2-coloring and other applications of the LLL, but with much worse parameters than 
ours. Our application of this tool is quite different from the way in which Alon uses this: 
Alon starts with a random 2-coloring of the hypergraph chosen from a small size limited in- 
dependence space; at least one of the sample points in this space has the property (roughly 
speaking) that the almost monochromatic hyperedges form small connected components. 
For such an assignment, one can alter it locally over vertices in each component to get a 
good 2-coloring. 

However, we do the following: recall that for a random table the expected number of 
consistent partial witness with size in the range [(\ogm)/(ek), 2(logm)/(e/c)] is at most 1/2. 
Each of these partial witness uses at most 21 °| m • k variables. Now, instead of using a 
completely random table, we use a table chosen according to a 21 ° e gm -wise independent 
distribution (i.e., any subset of at most 21ogw has the same joint distribution as in the 
original random table). So the partial witness tree is consistent with the new random table 
with the same probability as before. And hence the expected number of consistent partial 
witness trees with the new random table is still at most 1/2. But now the key point to note 
is that the number of tables in the new limited independence distribution is much smaller 
and we can try each of them in parallel till we succeed with one of the tables. To make the 
probability space even smaller we use /c-wise 5-dependent distributions, but the idea remains 
the same. Finally, to determine whether a table has no consistent heavy partial witness we 
run the parallel algorithm of Moser-Tardos on it. 

The above strategy requires that the number of variables on which witnesses depend be 
small, and hence the number of variables on which events depend should also be small. In 
our general parallel algorithm we relax this to some extent: instead of requiring that each 
event depend on few variables, we require that the decision tree complexity of the event is 
small. The proof goes through in the same way as above. 

3 The Partial Witness Structure 

In this section we define partial witness structure and prove a range lemma using weights. 

For every A G A we fix a rooted binary tree B^. All vertices in B^ have labels which 
are nonempty subsets of vbl(v4): The root of B^ is labeled by vbl(v4) itself, the leaves are 
labeled by singleton subsets of vbl(A) and every non-leaf vertex in B^ is labeled by the 
disjoint union of the labels of its two children. This means that every non-root non-leaf 
vertex is labeled by a set {v^, ■ ■ ■ , Vi k }, k > 2 while its children are labeled by {v^, ■ ■ ■ ,V{.} 
and {vi j+1 , ■ ■ ■ ,v ik } for some 1 < j < k — 1. Note that B^ consists of at most 2|vbl(A)| — 1 
vertices. We abuse the notation B^ to denote the labels of the vertices of this binary tree 
which are actually subsets of vbl(v4). This tree is not to be confused with the witness tree. 
The elements from B^ will be used to define the roots of the partial witness trees. 

Remark 1: (Regarding definition of the binary tree B^) For the proof of the range 
lemma it is sufficient to assume that the leaf-subsets of B^ induce cliques in the dependency 
graph. This is true when the leafs are singleton subsets. Using a minimum clique cover 
of the neighborhood as a leaf-set can significantly reduce the size of B^. We did not state 
this in Section [L2l for the sake of simplicity, but the smaller \Ma\ can replace the 2|vbl(yl)| 
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factor in the definition of M, reducing the running time bounds of our algorithms. This is 
for example necessary to obtain the running time bounds stated in Corollary [3] for fc-CNF. 
In the case of fc-CNF the neighborhood vbl(A) of a clause A is naturally covered by at most 
k cliques, which arise from all the clauses that share a common variable p G vbl(A). Thus 
using a minimum clique cover gives |B^| < 2k for fc-CNF, saving a factor of 0(m) in the 
running time. 

A partial witness tree r$ is a finite rooted tree whose vertices apart from the root are 
labeled by events while the root is labeled by some S G for some A G A. The children 
of the root receive labels from the set of events which depend on any of the variables in S; 
the children of every other vertex labeled by B, receive labels from T + (B). For notational 
convenience, we use V(rg) := V{ts) \ {Root(rs)} and denote the label of a vertex v G V(ts) 
by [v]. 

A full witness tree is a special case of a partial witness where the root is the complete 
set vbl(v4) for some A G A. In such a case, we relable the root with A instead of vbl(v4). 
Note that this definition of full witness tree matches the one of witness trees in [13]. 

Define the weight of an event A G A to be w(A) = — \ogx'(A). Define the weight of 
a partial witness tree t$ as the sum of the weights of the labels of the vertices in V(ts), 
i.e., 

w ( r s) ■= yi MM)=- io g n 

v£V(t s ) v€V(t s ) 

The depth of a vertex in a witness tree is the distance of that vertex from the root in 
the witness tree. We say that a partial witness tree is proper if all children of a vertex have 
distinct labels. 

Similar to [IT] , we will control the randomness using a table of evaluations. Let us 
denote this table by T. This table contains a row for each variable. The row for each 
variable contains evaluations for the variable. Note that the number of columns in the table 
could possibly be infinite. In order to use such a table in the algorithm, we maintain a 
pointer t p for each variable p G V indicating the column containing its current value used 
in the evaluation of the events. We denote the value of p at t p by T(p,t p ). If we want to 
resample for a variable, we just increment this pointer by one. 

It is clear that running the randomized algorithm is equivalent to picking a table with 
random values and using such a table to run the algorithm. We call a table T a random 
table if, for all variables p G V and all positions j, the entry T(p,j) is picked independently 
randomly according to the distribution of p. 

Algorithm 2: Moser-Tardos Algorithm with input table 

Input: Table T with values for variables. 

Output: An assignment of values for variables which makes none of the events in A happen. 

1. For every variable p G V: Initiate the pointer t p = 1. 

2. While 3A G A that happens when (VP G V : p = T(p, t p ), do: 

(a) Pick one such A. 

(b) Resample(v4): For all P G vbl(yl) increment t p by one 

3. Return p = T(p, t p ) : Vp G V 
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In the above algorithm, Step 2(a) is performed by a fixed arbitrary deterministic proce- 
dure. This makes the algorithm well-defined. 

Let C : N — > A be an ordering of the events, which we call the event-log. Let the ordering 
of the events as they have been selected for resampling in the execution of Algorithm 2 using 
a table T be denoted by an event-log Ct- Note that C is partial if the algorithm terminates 
in finite time. 

Given event-log C, associate with each step t and each S G Matt), a partial witness 

tree Tc(t,S) as follows. Define Tq (t,S) to be an isolated root vertex labeled S. Going 
backwards through the event-log, for each % = t — 1, t — 2, • ■ ■ , 1: (i) if there is a non-root 
vertex v G T^ +1 \t, S) such that C(i) G r + ([t>]), then choose among all such vertices the one 
having the maximum distance from the root (break ties arbitrarily) and attach a new child 
vertex u to v with label C(i), thereby obtaining the tree Tq (i, S), (ii) else if S H vbl(C(i)) 
is non-empty, then attach a new child vertex to the root with label C(i) to obtain Tq (t, S), 
(iii) else, set T$(t,S) = T^ +l) (t,S). 

Note that the witness tree Tc(t,S) is partial unless S = T + (C(t)), in which case the 
witness tree is identified with a full witness tree. For such a full witness tree, our construction 
matches the construction of witness trees associated with the log in [13J. 

We say that the partial witness tree Ts occurs in event-log C if there exists t e N such 
that for some A e A, C(t) = A and r s = r c (t, S) for some S G B^. 

For a table T, a T-check on a partial witness tree t$ uses table T as follows: In an 
order of decreasing depth, visit the non-root vertices of r$ and for a vertex with label A, 
take the first unused value from T and check if the resulting evaluation makes A happen. 
The T-check passes if all events corresponding to vertices apart from the root, happen when 
checked. We say that a partial witness tree is consistent with a table T if T-check passes 
on the partial witness tree. 

Lemma 6. If a partial witness tree r s occurs in the event-log Ct, then 

1. Ts is proper. 

2. Ts passes the T-check. 

Proof. The proof of this lemma is similar to the one in [T3] . 

Since t s occurs in Ct, there exists some time instant t such that for S G M CT (t), t s = 
Tc T (t, S). Let d(v) denote the depth of vertex v G V(rg) and let q(v) denote the time instant 
of the algorithm constructing Tc T (t, S) in which v was attached, that is, q(v) is the largest 
value q with v contained in T^\t). 

If q(u) < q(v) for vertices u, v G V(ts) and vbl([w]) and vbl([f]) are not disjoint, then 
d(u) > d(v). Indeed, when adding the vertex u to T^ u ' +1 \t) we attach it to v or to another 
vertex of equal or greater depth. Therefore, for any two vertices u, v G V(ts) at the same 
depth d{y) = d(u), [u] and [v] do not depend on any common variables, that is the labels in 
every level of Ts form an independent set in G. In particular Ts must be proper. 

Now consider a vertex v in the partial witness tree rg. Let S(p) be the set of vertices 
w G t s with depth greater than that of v such that [w] depends on variable p. 

When T-check considers the vertex v labeled by B and uses the next unused evaluation 
of the variable P, it uses the evaluation T(p, \ S(p)\). This is because the witness check visits 
the vertices in order of decreasing depth and among the vertices with depth equal to that 
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of v, only [v] depends on p and so before the witness-check considers v it must have used 
values for p exactly when it was considering the vertices in S(p). 

At the time instant of resampling [v], say t v , the algorithm chooses [v] to be resampled 
which implies that [v] happens before this resampling. For p G vbl([t>]), the value of the 
variable p at t v is T(p, \S(p)\). This is because, the pointer for P was increased for events [w] 
which were resampled before the current instance, where w G S(p). Note that every event 
which was resampled before t v and which depend on [v] would be present at depth greater 
than that of v in rg by construction. Hence, S(p) is the complete set of events which led to 
resampling of p before the instant t v . 

As the T-check uses the same values for the variables in vbl([i>]) when considering v, it 
also must find that [v] happens. □ 

Next, we state the important lemma used for derandomizing efficiently. 

Lemma 7. If a partial witness tree of weight at least 7 occurs in the event-log Ct such that 
x '{[ v \) > l/4m for every non-root vertex v, then a partial witness of weight G [7,27] occurs 
in the event-log Ct- 

Proof. Suppose not. Then, consider the least weight partial witness tree whose weight is at 
least 7 which occurs in the event-log Ct, namely rg = Tc T (t,S) for some t, S G B^. By 
assumption, w(t s ) > 27. We have two cases: 

Case (i): Root (rg) has only one child. Let this child be labeled by v. Let t' be the 
largest instant before t at which [v] was resampled. Now, consider the partial witness tree 
t' s = Tc T (t',S') = T + ([v]) (it is a partial witness tree since S' G Br„i). Since t' s contains 
one less vertex than rg, w{r' s ) < w(r s ). Also, since x'(A) > l/4m, we have log (l/x'(A)) < 
log (4m) < l ^sM = 7 (by the parameters defined in section 1.2) we have that w(t' s ) = 
w(t s ) — log (l/x'(A)) > 7. Finally, by definition of r 5 , it is clear that t' s occurs in the 
event-log Ct- Thus, r' s is a counterexample of smaller weight contradicting our choice of rg. 

Case (ii): Root(rg) has at least two children. This means that S = {A^,--- ,Ai k } for 
some k > 2. In B^, starting from S, we now explore the children of S in the following 
way, looking for the first vertex whose children S L and S R reduce the weight of the tree, 
i.e., < w(t Sl ),w(t Sr ) < w(t s ), where t Sl = r CT (t,S L ) and t Sr = T CT (t,S R ): if a vertex 
Sl reduces the weight of the tree without making it zero (i.e., < w(ts l ) < w(ts)), then 
its variable disjoint sibling Sr must also reduce the weight of the tree; on the other hand 
if a vertex Sl reduces the weight of the tree to zero, then its sibling Sr can not reduce the 
weight of the tree. Suppose Sl reduces the weight to zero, then we explore Sr to check if 
its children reduce the weight. Lastly the weight has to reduce while considering a vertex 
which is a parent of leaves. 

By definition, both rg L and t$ r occur in the event-log Ct- Since we pick the first siblings 
Sl and Sr (in the breadth first search) which reduce the weight, their parent S' is such that 
w(t~S') > w(t~s), where rg/ = T CT (t,S'). We are considering only those S' such that S' C S 
which implies that w(ts>) < w(t s ). Hence, w(ts') = uj(t s ) and for every vertex with label 
A in rg, one can find a unique vertex labeled by A in rg/ and vice- versa. Further, S' is the 
disjoint union of Sl and Sr] therefore, for vertex with label A in rg/, one can find a unique 
vertex labeled by A either in rg L or rg fl . 

As a consequence, we have that for every vertex with label A in rg, one can find a unique 
vertex labeled by A either in t Sl or t Sr . Hence, w(t Sl ) + w(t Sr ) > w(r s ) and therefore, 
max{ w (ts l ), w(t Sr )} > w{r s )/2 > 7. So, if we consider the witness among rg L and t Sr with 
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larger weight, it is a counterexample of weight at least 7 but of weight less than that of ts, 
contradicting our choice of T5. □ 



4 Deterministic Algorithm 

In this section we describe a sequential deterministic algorithm and prove Theorem [2j 

For the rest of the paper we define a set of forbidden witnesses F which contains all 
full witnesses consisting of single events of large weight and all partial witness trees of weight 
between 7 and 27 consisting only of small weighted events. More formally, F := F\ U F2 
where 

• Fx := {t = v I [v] E A \ A} and 

• F 2 := {r I w(t) E [7, 27] A Vt> G r, [v] E A A (Root(r) E M A for some A E A }. 

Recall that A = {A E A \ x'(A) > With these definitions we can state our determin- 
istic algorithm. 

Algorithm 3: Sequential Deterministic Algorithm 

1. Enumerate all forbidden witnesses in F. 

2. Construct table T by the method of conditional probability: For each variable p EV, 
and for each j , < j < — ^— , do 

• Select a value for T(p,j) that minimizes the expected number of forbidden wit- 
nesses that can occur in the event-log given that all values chosen so far and 
T(p,j) are fixed and the yet uncomputed values are random. 

3. Run Algorithm 2 using table T as input. 

For the Algorithm 3 to work efficiently, we need that the table T can be constructed 
by the method of conditional probability quickly. For this, we prove that the number of 
forbidden witnesses is polynomial using Lemma [9j Further, we prove that the table given 
by the method of conditional probability is a good table; that is, the execution of the 
sequential algorithm using this table terminates within polynomial number of steps. We 
show this as follows: Lemma [H] proves that the number of forbidden witnesses that occur 
in the event-log by using this table is zero (due to the invariant maintained by the method 
of conditional probability while obtaining the table). Therefore, the sequential algorithm 
does not encounter any of the forbidden witnesses. Putting this together with Lemma U\ we 
can show that the maximum weight of any partial witness tree occurring in the event-log 
produced by the execution of the sequential algorithm with any fixed order to resample is 
at most 7. Finally, the maximum number of vertices in a partial witness tree of weight at 
most 7 is not too large. Hence, the maximum number of times any event is resampled while 
running the sequential algorithm using table T is not too large. 

Lemma 8. The expected number of forbidden witnesses occurring in the event-log Ct, where 
T is a random table, is less than \ . 
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Proof. We first prove that the expected number of witnesses r E F 2 occurring in Ct is at 
most |: For each event A £ A, let and be the set of partial and respectively full 
witness trees in F 2 with root from Ma- With this notation the expectation in question is 
exactly: 

E := Pr (r occurs in the event-log Ct) 

Note that according to Lemma [6j a partial witness tree occurs in the event-log Ct only 
if it passes the T-check. Clearly, the probability that a witness r passes the T-check for 
the random table T is riueF(T) P r (M)- Using this and the assumption in Theorem [2] that 
Pr(M) < x'([v]) 1+e we get 

£<ee n *'(M) ] 

AeA T ^ T A v£V(t) 



Note that each full witness tree with root A £ A can give rise to at most |B^| partial witness 
trees since the possible subsets that we consider are only the sets which occur as labels of 
vertices in the binary tree B^. Hence, we can rewrite to get that the expectation is 

e<j2_\ m *\ Yi n *'(M) i+e 

AeA reT' AV (zv(T) 



AeA reV A \vev{T) 



7e 



where the last expression follows because, for r £ T^, we have: 

w(t) = - log Yl x '([ v \) > 7 



Since every partial witness tree that occurs in the event-log is proper (LemmaEf, by using 
the Galton- Watson process, similar to the summation in [13J, we get that the expectation is 

^eM(e n *>»V 

AeA v ; \rer' A veV(T) ) 




where the last inequality follows since |B^| < 2|vbl(A)| and using 7 = — . Hence, the 
expected number of witnesses r £ F 2 occurring in Ct is at most j. 
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To calculate the expected number of forbidden witnesses in F = F\ U F 2 that occur in 
Ct we use the linearity of expectation to obtain the desired result: 



EPr (r occurs in Ct) = Pr (r occurs in Ct) H — < m 1 — = - 
v ' ' 4 4m 4 2 



□ 



Owing to the definition of forbidden witnesses via weights, there is an easy way to count 
the number of witnesses using the fact that their expected number is small. 

Lemma 9. The number of forbidden witnesses \F\ is less than M 2 ( 1+1 ' € \ 

Proof. We first count the number forbidden witnesses in F 2 . Each of them have weight 
w(t) < 27 and thus: 



t£F 2 t£F 2 \v£V{t) 

where the last inequality comes directly from the proof of Lemma [SJ Therefore \F 2 \ < 
I 2 27(i+^) = 1^2(1+1/6) and the tQtal number Q f forbidden witnesses is |F| = |Fi| + \F 2 \ < 
m+ i M 2 ( 1+1 A) < m 2 ( 1+1 A). □ 

We are ready to prove Theorem [2j 

Proof of Theorem The first step of the deterministic algorithm enumerates all forbidden 
witnesses in F. By Lemma [91 there are at most M 2( - 1+1 /^ forbidden witnesses and it is easy 
to enumerate them using a dynamic programming approach. 




The running time to find the suitable table T using the method of conditional probabilities 
can be computed as follows: each variable has at most D possible choices of values; for each 
value we compute the expected number of forbidden witnesses that can occur in the event- 
log by incrementally computing the probability of each forbidden witness in F to occur and 
keeping track of the sum of these probabilities. Further, each forbidden witness r e F 2 has 
weight at most 27 and thus each forbidden witness consists of at most k = ( — — h 1 ) = 

2 log AI + 1 vertices. Hence, the time to compute T is at most 

0(D -\F\-k-t c ) = ( CM2 " +1/ " l0sM tc ) = 6 (DM»h c ) 

since w m i n > 1/M. To complete the proof we show that the running time of the sequen- 
tial algorithm on a table T obtained by Step 2 of the deterministic algorithm is at most 

First, note that by running the sequential algorithm using the table T, none of the 
forbidden witnesses can occur in the event-log Ct- This is because the table is obtained by 
the method of conditional probabilities: In the beginning of the construction of the table, 
when no value is fixed, the expected number of forbidden witnesses that occur in the event- 
log is less than 1/2 as proved in Lemma [HJ this invariant is maintained while picking values 
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for variables in the table; thus, once all values are fixed the number of witness trees in F 
that occur in the event-log Ct is still less than 1/2 and hence zero. This implies that none 
of the events A G A \ A are resampled while using the table T since these events are labels 
of forbidden witnesses in It also guarantees that the sequential algorithm with T as 
input resamples each event A G A at most k times. This is because, if some event A G A is 
resampled more than k times, then, A occurs in the event-log Ct at least k times. Now, the 
weight of the partial witness tree associated with the last instance at which A was resampled, 
would weigh at least kw m i n which is more than 27, a contradiction to Lemma [7J Therefore, 
the running time for the sequential algorithm using the table T is O (m • k ■ to) which is 
smaller than the upper bound for the time needed to find the good table T. □ 

To obtain the corollary regarding fc-CNF, we use the optimum values for the parameters 
used to obtain the deterministic algorithm for the general version. 

Proof of Corollary^ For a fc-CNF formula with clauses A = {Ax, ■ ■ ■ , A m }, for each clause 
A G A we define an event A and say that the event happens if the clause is unsatisfied. 
Further, each variable appearing in the formula picks values uniformly at random from 
{0, 1}. Then, for every event A, Pr (A) = 2~ k . We assume that k = O(logm) for otherwise 
the problem becomes simple. If d is the maximum number of clauses that a clause shares 
its variables with, setting x(A) = 1/d \/A G A, we obtain that x'(A) > j-. The condition 
that d < 2 fe/(1+e) /e can be translated to Pr (A) < x'(A) 1+€ V71 G A. Therefore, we use 
parameters tc = 0(k), w m i n — 0(k), D — 2 and a slightly optimized parameter of M with 
T + (A) replaced by k (see the Remark at the beginning of Appendix Section 7.1), to obtain 
M = 0(mk) = O(mlogm). With these parameters the following corollary follows directly 
from Theorem [2J □ 

5 Parallel Algorithm 

We need the definition of (k, <5)-approximate distribution to describe our algorithm. 

Definition 10. (k, 5)- approximations^ : Let T> be the probability distribution on Di x D 2 x 
. . . x D n given by the random variables in V. For positive integer k and constant 5 G (0, 1) 
a probability distribution y on D\ x D 2 x . . . x D n is said to be a (k, 5) -approximation ofT> 
if the following holds. For every I C [n] such that \I\ < k, and every v E Di x D 2 x . . . x D n 
we have 

|PrM -PrMI < 5, 
v y 

where Ptx>[vj] denotes the probability that for a random vector (xi, . . . ,x n ) with probability 
distribution T> we get x,- t = v,i for i G I and Viy[vi\ is defined similarly. 

Algorithm 4: Parallel Deterministic Algorithm 

1. Take Y as the domain of a (k, (^-approximation y for the probability space of random 
tables. 

2. For each table T G Y do in parallel: 

(a) For every variable p G V: Initiate the pointer t p — 1. 
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(b) While 3AeA that happens when (VP G V : p = T(p, t p ), do: 

• Compute a maximal independent subset / of those clauses. 

• Resample(A): For all P G (J Ag/ vbl(y4) increment t p by one. 

3. Once a valid assignment is found using one of the tables, output it and terminate. 

As in the sequential algorithm, the problem reduces to finding a table on which the above 
algorithm terminates quickly. Our algorithm relies on the following observation: instead of 
sampling the values in the table independently at random, if we choose it from a distribution 
that is a (k, ^-approximation of the original distribution (for appropriate k and 5), the 
algorithm behaves as if the values in the table had been chosen independently at random 
(Proposition [H]). The support of a (k, 5) distribution is polynomially small, and so this 
gives us a polynomial sized set of tables which is guaranteed to contain at least one table 
on which the algorithm terminates quickly (Corollary [T2]) . Our algorithm runs the Moser- 
Tardos parallel algorithm on each of these tables in parallel, and stops as soon as one of the 
tables lead to the good evaluation. 

Proposition 11. Let S = D\ X . . . x D n be a product space of finite domains of size at most 
D = maxj \Di\, V be an independent product distribution on S and f, fi, f2'-S^ {0, 1} be 
boolean functions on S. 

• If f\ and fi have decision tree complexity k\ and k 2 respectively, then the decision tree 
complexity of f\ A f 2 is at most k\ + k 2 . 

• If f has decision tree complexity at most k then every (k, 5) -approximation y of V is 
D k d~ -indistinguishable from V , i.e.: 

\E y (f) - E v (f)\ < D k 6 

Proof. For the first claim we recall that a function having decision tree complexity at most 
k means exactly that we can determine its evaluation f(x) for x G S by adaptively querying 
at at most k components of x. If this is true for f\ and f'2 with k\ and k 2 respectively then 
we can easily evaluate fi(x) A f2(x) by adaptively querying at most k\ + k 2 components of 
x. Therefore this conjunction has decision tree complexity at most k± + k 2 . 

For the second claim, we fix a decision tree of / with depth at most k. Each of the 
leaf-to-root paths in this tree corresponds to a partial assignment of values to at most k 
components which already determines the result of /. The expectation of / under any 
distribution is simply the sum of the probabilities of the paths resulting in a 1-evaluation 
at the leaf. Switching from a completely independent distribution to a fc-wise independent 
distribution does not change these probabilities since the partial assignments involves at most 
k variables. Similarly switching to a (k, (^-approximation changes each of these probabilities 
by at most S. There are at most D k paths resulting in a 1-evaluation which implies that the 
deviation of the expectation is at most D k 5. □ 

Remark 2: Instead of assuming for each event A, a decision tree complexity of at most 
k(A) = 0(min{logM, — \ogx'(A)}) it suffices to demand a less strong assumption that the 
boolean function for each event A can be expressed by a polynomial of degree at most k(A) 
with coefficients and number of terms bounded by 2 k ( A \ In general any property that fulfills 
the above "additivity" and implies the property that any (k, ^-approximation is 2°^S- 
indistinguishable suffices. We do not know whether this property itself behaves additively. 
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Corollary 12. The expected number of forbidden witnesses consistent with a table T that 
was created by a (k, 5) -approximation for distribution of random tables with k = 2cy and 
(T 1 = 3M 2+2 / e £> 2c7 is at most 1/2 + 1/3 < 1. 

Proof. The event that a partial witness r £ F 2 is consistent with T is exactly the conjunction 
of events [v], v £ V(r); using Proposition [Til the decision tree complexity of this tree is at 
most ^2 ve v( T ) cmrn { l°g M, — logx 1 ([v])}. Also, if r £ F 1; then clearly the decision tree 
complexity of r is at most clogM < cy; else if r £ F 2 , then 

cmin{logM, — logx'([f])} < c — logx'Qu]) < 2cry. 

Lemma [8] shows that using the original independent distribution V, the expected number 
of forbidden witnesses occurring is at most 1/2. The second claim of Proposition [TT1 proves 
that switching to a (k, ^-approximation changes this expectation by at most D k 5 = 3M2 1 +2/ e 
for each of the \F\ witnesses. To complete the proof, observe that by Lemma [9] we have 
|F| < M 2+2 ' e . □ 

We obtain the proof of Theorem [4] along the outline mentioned in the beginning of the 
section. 

Proof of Theorem [^} We use Algorithm 4 to obtain the good evaluation. Corollary [12] guar- 
antees that there is a table T £ Y for which there is no forbidden witness consistent with it. 
Steps l2"aH3l are exactly the parallel algorithm of Moser and Tardos [T3] and by using Lemma 
4.1 of [13], if this algorithm runs for % iterations then there exists a consistent witness of 
height i. Such a witness has weight at least i x u> m m and thus i < — ^— . Each of these % 
iterations takes time t eva i to evaluate all m clauses and time tMis to compute the indepen- 
dent set on the induced dependency subgraph of size at most m. This proves that after 
creating the probability space y the algorithm terminates in Oi-p—itMis + tevai)) time and 
the termination criterion guarantees a valid outcome. The number of processors needed for 
the loop is polynomial bounded by for each of the \Y\ parallel computations. 

It can be shown that Y can be constructed efficiently in parallel and that it is at 
most polynomially large. For this, we use the construction described in [B] (which in turn 
uses a construction in [T4J). This construction builds a (k, ^-approximation to a prod- 
uct space with maximum domain size D and \T\ variables. Such a space has size \Y\ = 
poly(2 fc ,log|T|,5- 1 ) = M°(? lo s D ) and can be constructed in parallel in time poly (log k + 
log log | T | + loglog<5 -1 ) = O(logM). The construction described in [6] can be parallelized 
with the required parameters easily. □ 



6 Discussion 



Moser and Tardos [13] raised the open question for a deterministic LLL algorithm. We ad- 
dress this and give a deterministic algorithm that works under nearly the same conditions as 
its randomized versions. All known deterministic or (randomized) parallel algorithms need a 
slack in the LLL conditions. Whether our e-slack can be removed remains open. Similar to 
Moser- Tardos, our sequential algorithm needs the assumption that exact conditional proba- 
bilities can be computed efficiently These assumptions seem to hold for many applications 
and exact computations of probabilities can be performed reasonably efficiently. It is still 



16 



an interesting question how much room is there for approximate computations and whether 
those assumptions are needed. The complexity assumptions of our parallel algorithm are 
alternative assumptions which may also be weakened (see Remark 2). 

We have an improved parallel algorithm that determines a valid assignment directly 
from a good table needing only polynomially many non-adaptive evaluations. While this is 
a surprising result on its own, it also speeds up the running time. Nevertheless one maximal 
independent set needs to be computed and it is an interesting question whether there is a 
parallel algorithm that avoids this. 

This parallel algorithm is of interest because it extends to the Lopsided LLL, which uses 
a weaker notion of event-dependency. Moser-Tardos proved that their sequential algorithm 
extends to this case but their parallel algorithm does not. The same is true for our sequential 
and parallel algorithm. The improved parallel algorithm overcomes this and is the first 
parallel algorithm for the lopsided LLL. 

References 

[1] N. Alon. A Parallel Algorithmic Version of the Local Lemma. Random Struct. Algo- 
rithms, 2(4):367-378, 1991. 

[2] J. Beck. An Algorithmic Approach to the Lovasz Local Lemma. Random Struct. 
Algorithms, 2(4):343-366, 1991. 

[3] S. Chari, P. Rohatgi, and A. Srinivasan. Improved Algorithms via Approximations of 
Probability Distributions. J. Comput. Syst. Sci., 61(1):81-107, 2000. 

[4] A. Czumaj and C. Scheideler. A new algorithm approach to the general Lovasz local 
lemma with applications to scheduling and satisfiability problems (extended abstract). 
In STOC, pages 38-47, 2000. 

[5] P. Erdos and L. Lovasz. Problems and results on 3-chormatic hypergraphs and some 
related questions. In A. Hajnal, R. Rado and V.T. S6s, editors, Infinite and Finite Sets 
(Colloq., Keszthely, 1973; dedicated to P. Erdos on his 60th birthday), volume 2, pages 
609-627, 1975. 

[6] G. Even, O. Goldreich, M. Luby, N. Nisan, and B. Velickovic. Efficient approximation 
of product distributions. Random Struct. Algorithms, 13(1):1— 16, 1998. 

[7] U. Feige. On allocations that maximize fairness. In SODA '08: Proceedings of the nine- 
teenth annual ACM-SIAM Symposium on Discrete Algorithms, pages 287-293, Philadel- 
phia, PA, USA, 2008. Society for Industrial and Applied Mathematics. 

[8] F. T. Leighton, B. M. Maggs, and S. Rao. Packet Routing and Job-Shop Scheduling in 
(Congestion + Dilation) Steps. Combinatorica, 14(2):167-186, 1994. 

[9] M. Molloy and B. Reed. Graph Colouring and the Probabilistic Method. Springer, 2000. 

[10] M. Molloy and B. A. Reed. Further Algorithmic Aspects of the Local Lemma. In STOC, 
pages 524-529, 1998. 



17 



[11] R. A. Moser. Derandomizing the Lovasz Local Lemma more effectively. CoRR, 
abs/0807.2120, 2008. 



[12] R. A. Moser. A constructive proof of the Lovasz local lemma. In STOC '09: Proceedings 
of the 41st annual ACM Symposium on Theory of Computing, pages 343-350, New York, 
NY, USA, 2009. ACM. 

[13] R. A. Moser and G. Tardos. A constructive proof of the general Lovasz Local Lemma. 
CoRR, abs/0903.0544, 2009. 

[14] J. Naor and M. Naor. Small-Bias Probability Spaces: Efficient Constructions and Ap- 
plications. SI AM J. Comput, 22(4):838-856, 1993. 

[15] A. Srinivasan. Improved algorithmic versions of the Lovasz Local Lemma. In SODA, 
pages 611-620, 2008. 



18 



